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Shrub willow, Salix spp. and hybrids, is an important bioenergy crop. Here we report the whole-genome sequences and atmota- 
tion of 13 endophytic bacteria from stem tissues of Salix purpurea grown in nature and from commercial cultivars and Salix 
viminalis X Salix miyabeana grown in bioenergy fields in Geneva, New York. 
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Bioenergy produced from plant biomass as a feedstock has the 
potential to mitigate concerns regarding climate change and 
sustainability, among others (1). Shrub willow, Salix spp. and hy- 
brids, is an attractive plant for bioenergy given its rapid growth 
and sustainable growth characteristics (2). There have been nu- 
merous efforts regarding the development of shrub willow culti- 
vars suitable for the production of bioenergy (3). Endophytes have 
been isolated from tissues of tree willows in nature (4), but it is 
unknown what role endophytes might play in shrub willows grow- 
ing in intensively managed bioenergy fields. To gain some insights 
into endophytic bacteria that associate with shrub willow, we em- 
barked on a project which resulted in the identification of 69 
unique bacteria, of which 13 were subjected to whole-genome 
sequencing and annotation. Plant-associated bacteria that are 
beneficial to Salix have the potential to improve crop production 
while reducing need for inputs. In addition, the identification of 



bacteria that are detrimental to growth and development through 
phytopathogenesis is also of interest as a foundation for breeding 
for resistance and maximizing growth potential. The endophytic 
bacteria were initially isolated from surface-sterilized stem tissues 
from wild accessions of S. purpurea growing in nature and from 
commercial cultivars of S. purpurea and S. viminalis X S. miya- 
heana growing in fertilized and nonfertilized bioenergy fields in 
Geneva, NY. The 1 3 endophytes were initially identified by ampli- 
fication and nucleotide sequence analysis of the variable 3 region 
of the 16S rRNAgene(5). 

Genomic DNA was isolated from the endophytes using a 
GenElute bacterial genomic kit (Sigma-Aldrich, St. Louis, MO) and 
prepared for whole-genome sequencing using a Nextera XT library 
preparation kit (lUumina, San Diego, CA). Whole-genome sequenc- 
ing was performed using the lUumina Miseq (150-bp paired-end 
reads). The reads were error corrected and assembled de novo using 



TABLE 1 Sequencing and annotation results for the 13 endophytes isolated from Salix 

Genome 



Strain 


Source" 


SubID 


BioProject no. 


BioSample no. 


Accession no. 


Organism 


coverage 

(X) 


Genome 
size (bp) 


No. of 
contigs 


No. of 
ORFs 


No. of 
tRNAs 


No. of 
rRNAs 


RIT273 


Fabius 


SUB467427 


PRINA239282 


SAMN02676620 


JFOKOOOOOOOO 


Pantoea agglomerans 


138 


5,365,338 


26 


4,914 


75 


17 


RIT283 


Fabius 


SUB467430 


PRJNA239283 


SAMN02676621 


JFOJOOOOOOOO 


Staphylococcus 


289 


2,527,922 


81 


2,445 


49 


14 














haemolyticus 














RIT288 


Fish Creek 


SUB468060 


PRJNA239284 


SAMN02676622 


JFYNOOOOOOOO 


Pseudomonas sp. 


191 


6,273,290 


44 


5,547 


61 


10 


RIT293 


Fabius 


SUB468074 


PRJNA239285 


SAMN02676623 


JFYOOOOOOOOO 


Microbactertum 


194 


2,898,622 


11 


2,782 


49 


8 














oleivorans 














RIT304 


Wild 


SUB468076 


PRJNA239286 


SAMN02676624 


JFYPOOOOOOOO 


Micrococcus luteus 


166 


2,506,829 


183 


2,248 


53 


4 


RIT305 


Wild 


SUB468079 


PRJNA239287 


SAMN02676625 


JFYQOOOOOOOO 


Micrococcus luteus 


200 


2,612,381 


110 


2,350 


49 


6 


RIT308 


Fabius 


SUB468082 


PRINA239288 


SAMN02676626 


JFYROOOOOOOO 


Janthinobacterium 


189 


6,212,741 


44 


5,431 


83 


19 














lividum 














RIT309 


Fabius 


SUB468084 


PRJNA239289 


SAMN02676627 


JFYSOOOOOOOO 


Stenotrophomonas 


194 


4,634,795 


45 


4,141 


77 


6 


RIT313 


Fabius 


SUB468085 


PRJNA239290 


SAMN02676628 


JFYTOOOOOOOO 


sp. 

Delftia sp. 


56 


6,698,360 


122 


5,936 


78 


5 


RIT324W 


Fabius 


SUB468086 


PRJNA239291 


SAMN02676629 


(FYUOOOOOOOO 


Micrococcus luteus 


459 


2,635,230 


118 


2,381 


53 


7 


RIT328 


Fish Creek 


SUB468089 


PRINA239292 


SAMN02676630 


JFYVOOOOOOOO 


Sphingomonas sp. 


73 


4,343,511 


56 


4,002 


58 


3 


RIT341 


Fabius 


SUB468091 


PRJNA239293 


SAMN02676631 


JFYWOOOOOOOO 


Exiguobacterium sp. 


138 


3,107,022 


15 


3,168 


62 


20 


RIT357 


Wild 


SUB468093 


PRJNA239294 


SAMN02676632 


JFYXOOOOOOOO 


Pseudomonas sp. 


69 


3,107,022 


49 


5,552 


60 


7 



" Wild, wild Salix purpurea; Fish Creek, Salix purpurea Fish Creek cultivar; Fabius, Salix viminalis X S. miyabeana Fabius cultivar. 
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Spades 2.5 (6). Scaffolding of the contigs and in silico gap-closing of 
the resulting scaffolds were performed with SSPACE and GapFUler, 
respectively (7, 8). Genome annotation was performed using the 
Prokka annotation pipeline, which incorporated Prodigal 2.60, Ara- 
gorn, and RNAmmer 1.2 for the prediction of open reading frames 
(ORFs), tRNAs, and rRNAs, respectively (9-11). Additional annota- 
tion of the predicted protein sequences was done using Inter- 
ProScanS (12). The key attributes for the genome sequences and an- 
notation are summarized in Table 1 . An in-depth analysis of the genes 
associated with plant-microbe symbiosis is under way and will be 
published in a subsequent report. 

Nucleotide sequence accession numbers. The nucleotide se- 
quences have been deposited at DDBJ/EMBL/GenBank under the 
accessions numbers provided in Table 1 . 
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